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Penultimate Proline Abstract. The influence of the position of the amino acid proline in polypeptide 
sequences is examined by a combination of ion mobility spectrometry-mass spec¬ 
trometry (IMS-MS), amino acid substitutions, and molecular modeling. The results 
suggest that when proline exists as the second residue from the N-terminus (i.e., 
penultimate proline), two families of conformers are formed. We demonstrate the 
existence of these families by a study of a series of truncated and mutated peptides 
derived from the 11-residue peptide Serhpro 2 -Glu 3 -Leu 4 -Pro 5 -Ser 6 -Pro 7 -Gln 8 -Ala 9 - 
Glu^-Lys 11 . We find that every peptide from this sequence with a penultimate proline 
CCS (A 2 ) residue has multiple conformations. Substitution of Ala for Pro residues indicates that 

multiple conformers arise from the cis-trans isomerization of Xaa 1 -Pro 2 peptide bonds 
as Xaa-Ala peptide bonds are unlikely to adopt the cis isomer, and examination of spectra from a library of 58 
peptides indicates that -80% of sequences show this effect. A simple mechanism suggesting that the barrier 
between the cis- and trans- proline forms is lowered because of low steric impedance is proposed. This observa¬ 
tion may have interesting biological implications as well, and we note that a number of biologically active peptides 
have penultimate proline residues. 
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Introduction 

hile examining the influence of the position of different 
amino acid residues in polypeptide sequences on peptide 
conformation, we find evidence that proline in the second position 
from the amino-terminus (i.e., penultimate proline) gives rise to 
two types of peptide families. This penultimate proline has a 
propensity to induce two types of stable conformations, having 
mostly similar populations. Prior work has noted the existence of a 
penultimate proline in a large number of similarly sized signaling 
peptides [1-4]. It has been proposed that proline in this position 
protects the small amino acid chain against further enzymatic 
degradation [1-3]. The evidence presented here that two popula¬ 
tions of conformers are favored suggests that penultimate proline 
may play another role involving multiple structures that may 
influence activity and binding. 
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Cis-trans isomerization of peptide bonds is one mechanism 
that can lead to large changes in the overall structure of peptides 
and proteins [5-8]. Such isomers are most commonly associated 
with proline, primarily because of the 20 natural amino acids 
proline is unique in that it has a secondary amine at the backbone 
nitrogen atom [9, 10]. This makes it possible for proline to adopt 
the cis isomer of Xaa-Pro peptide bonds (where Xaa is any amino 
acid) at a higher frequency than Xaa-Xnp peptide bonds (where 
Xnp is any non-proline amino acid) [9, 10]. Although most 
previous studies classify proline residues either as cis or trans 
configurations, an increasing number of systems appear to adopt 
structures that utilize both the cis and trans isomers, and the two 
forms may have different functions [11]. Thus, it is important to 
understand the fundamental factors that influence the ability of 
Xaa-Pro bonds to adopt both cis and trans isomers. 

In this study, we use a combination of electrospray ioniza¬ 
tion (ESI) [12], ion mobility spectrometry (IMS) [13-17], and 
mass spectrometry (MS) to examine the conformation of a 
range of peptide sequences. The mobility of an ion through a 
buffer gas depends upon the shape of the ion, and thus is an 
effective means of delineating large differences in conforma¬ 
tion. When combined with ESI and MS, it is possible to sample 
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different conformer populations directly from solution [18-25]. 
Because of the ability to separate conformations that arise from 
both cis and trans configurations, IMS-MS is an appealing 
technique to analyze cis-trans isomerization of proline- 
containing peptides. Previous IMS-MS studies have 
established the important role that proline plays in the confor¬ 
mations of polypeptide ions [26-29]. Pierson et al. have shown 
by IMS-MS that proline residues are important in establishing 
multiple conformations of the nonapeptide bradykinin [27]. 

In this paper, we primarily examine the influence of proline 
position on the distribution of conformations observed by IMS- 
MS for the peptide Ser 1 -Pro 2 -Glu 3 -Leu 4 -Pro 5 -Ser 6 -Pro 7 -Gln 8 - 
Ala 9 -Glu 10 -Lys n —a tryptic peptide found in several proteins 
across different organisms. For example, it is located at residues 
271-281 in the CLPB Protein in Homo sapiens [30]. We are 
interested in studying this peptide in detail because it has multiple 
peaks in the ion mobility distribution of the [SPELPSPQAEK + 
2H] 2+ ion. Examination of this peptide (and related shorter se¬ 
quences formed as by-products of the synthesis) motivated us to 
analyze a library of doubly charged peptides, which was available 
in our laboratory; these studies show that -80% of all sequences 
that contain a penultimate proline adopt multiple conformers. 
Thus, it appears to be a common feature, in at least some families 
of sequences. We propose a simple mechanism in which the 
barrier of the system is lower because of decreased steric imped¬ 
ance associated with the penultimate position to explain the 
results. Overall, these findings expand upon previous studies 
[26,27] and help to illuminate the more complex role that proline 
plays in establishing peptide conformations. 

Experimental 

Ion Mobility Spectrometry-Mass Spectrometry 

IMS-MS experiments were performed on a home-built, ~2-meter 
instrument described in detail previously [31, 32]. Peptides were 
electrosprayed from 49.5:49.5:1.0 water:acetonitrile:formic acid 
solutions. Ions were produced by electrospray ionization with a 
Triversa Nanomate (Advion Bioscience, Inc., Ithaca, NY, USA). 
Ions are stored in a Smith-geometry hourglass-shaped ion funnel 
[33]. Packets of ions are periodically pulsed into the drift tube, 
which is operated with 3.00±0.02 Torr He buffer gas and an 
electric field of 10 V-cnT 1 . Ions exit the drift tube through a 
differentially pumped region and are analyzed by an orthogonal 
geometry two-stage reflectron time-of-flight mass spectrometer in 
a nested fashion [34]. 

Calculating Collision Cross Sections 

Drift time (t D ) distributions can be converted to collision cross 
section ( Q ) distributions according to [35], 

' 1 1 y /2 t D E 760 T 1 

M + Mb\ ~L~ P 273.2 V’ 

(1) 


where ze is the charge of the ion. K b is Boltzmann’s 
constant. T and P are the temperature and pressure, 
respectively. E and L are the electric field and length 
of the drift tube, respectively. N is the neutral number 
density at STP. For the instrument used in this study, 
collision cross sections can be calculated two ways. First, 
we can measure the time it takes the ions to traverse the 
first region of the drift tube from the source gate to the 
selection gate at the middle funnel and use Equation 1 
directly. Alternatively, we can use the time required for 
ions to traverse the entire drift tube. However, the ion 
funnels are not operated with a uniform electric field. 
Thus, we must calibrate measurements to well-studied 
systems such as bradykinin. Cross section values obtain¬ 
ed from both methods typically agree within 1%. This 
allows for the conversion of drift time distributions to 
accurate collision cross section distributions. 

In order to compare collision cross section distributions of 
peptides in which Ala residues have been substituted for Pro 
residues, we need to account for the intrinsic size difference 
between the side chains. Previous efforts have focused on 
developing intrinsic size parameters (ISPs) to provide insight 
into the average contribution each amino acid residue has to the 
collision cross section of peptide ions [36-38]. From this work, 
it was determined that the substitution of Ala for Pro residues 
leads to peptides with smaller cross sections on average. To 
account for this intrinsic difference in size, we have shifted our 
collision cross section distributions by 3.5 A 2 for each Pro —► 
Ala substitution. This shift in cross section is in good agree¬ 
ment with work by Pierson et al. in which triply charged 
bradykinin distributions were shifted by 2.5 A 2 for each Ala 
substitution. It is worth mentioning that previous work suggests 
ISPs are dependent on charge state [36-38]. Furthermore, the 
3.5 A 2 correction per Pro —► Ala substitution yields more 
precisely aligned distributions for the peptides analyzed in this 
study and both values lead to the same cis and trans isomer 
assignments. 

Peptide Synthesis 

Peptides were synthesized by Fmoc solid-phase synthesis on 
an Apex 396 peptide synthesizer (AAPPTec, Louisville, KY, 
USA) similar to the method described previously [27]. Fmoc 
side-chain protected amino acids and Wang-type polystyrene 
resins were used (Midwest Biotech, Fischers, IN, USA). 
Twenty percent piperidine in dimethylformamide was used 
for N a deprotections. l,3-Diisopropylcarbodiimide/6-chloro- 
1-hydroxybenzotriazole or 3-(diethoxyphosphoryloxy)-l,2,3- 
benzotriazin-4(3H)-one was used as the coupling reagent. 
Peptides were cleaved from the resin with a trifluoroacetic 
acid:triisopropylsilane:methanol solution at a 18:1:1 ratio, and 
5% 2,2'-(ethylenedioxy)diethanethiol was added for 
methionine-containing peptides. Peptides were washed and 
precipitated in diethyl ether. Precipitated products were dried 
down with a vacuum manifold before reconstituting in H 2 0 
and finally the ESI solvent. 
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Molecular Modeling 

Molecular modeling simulations were completed by a simulat¬ 
ed annealing procedure with the Insight II software package 
(Accelrys, Inc., San Diego, CA, USA) similar to simulations 
described previously [26]. Simulations were performed on the 
[SPQAEK + 2H] 2+ ion with charges assigned to the N-terminus 
and the Lys side chain. Simulated annealing was completed on 
two initial starting structures that differ by geometry of the 
Ser'-Pro 2 peptide bond (cis or trans). Ions were heated from 
298 to 500 K over 5 ps, equilibrated at 500 K for 10 ps, cooled 
to 298 K over 5 ps, and equilibrated at 298 K for 30 ps. The 
extensible systematic force field (ESFF) and a dielectric of 1 
were used; 100 trial geometries were obtained from each run. 
Collision cross sections were calculated using the trajectory 
method in Mobcal [39]. 


Results and Discussion 

IMS-MS of SPELPSPQAEK and Truncated 
Peptides 

Figure 1 shows the two-dimensional IMS-MS plot of ions 
produced by ESI of the cmde product from the synthesis of 
the peptide SPEEPSPQAEK. In addition to the 
[SPEEPSPQAEK + 2H] 2+ ion, we observe the truncated series 
of peptides (e.g., [PEEPSPQAEK + 2H] 2+ ) derived from the 
full-length sequence in relatively high abundance. Truncated 
peptide sequences result from the inefficiency of peptide syn¬ 
thesis. Although inefficient synthesis is generally thought to be 
problematic, this makes it possible to obtain IMS-MS measure¬ 
ments for many unique peptide sequences from a single syn¬ 
thesis. From this series of tmncated peptides, we observed an 
interesting trend—as single amino acid residues are added from 
the 5 residue peptide to the full-length sequence, the distribu¬ 
tion of conformers for each peptide is markedly different. That 
is, some peptides have multiple conformers and others do not. 
Upon this discovery, we synthesized each individual tmncated 
peptide derived from the SPEEPSPQAEK separately to probe 
this observation in detail. It is worth mentioning that distribu¬ 
tions obtained from the two different syntheses are nearly 
identical. 

Figure 2 shows the collision cross section distributions for 
the peptide [SPELPSPQAEK + 2H] 2+ ion and tmncated pep¬ 
tides. The distribution has two peaks at 252 and 258 A 2 with 
normalized abundances of 23% and 77%, respectively. Based 
on previous studies of proline-containing peptides by ion mo¬ 
bility, one likely explanation for the multiple conformations is 
that one of the Xaa-Pro bonds adopts both the cis and trans 
isomer [26, 27]. However, it is difficult to assign what residue 
or region of residues in the peptide sequence are important to 
observing multiple conformer families from the cross section 
distribution alone. Thus, we will consider the tmncated ver¬ 
sions of the peptide to examine how the distributions of ions 
change as a function of length and amino acid composition. 



Figure 1. Two-dimensional IMS-MS plot of crude synthe¬ 
sis product of the peptide SPELPSPQAEK electrosprayed 
from 49.5:49.5:1 water:acetonitrile:formic acid. Doubly 
charged [M + 2H] 2+ truncated sequences of the full- 
length peptide are labeled 


Collision cross section distributions for the tmncated pep- 
tides [PELPSPQAEK + 2H] 2+ , [ELPSPQAEK + 2H] 2+ , 
[LPSPQAEK + 2H] 2+ , [PSPQAEK + 2H] 2+ , [SPQAEK + 
2H] 2+ , and [PQAEK + 2H] 2+ are displayed in Figure 2. It is 
apparent that the removal of a single amino acid from the N- 
terminus dramatically influences the distribution of con¬ 
formers. The distribution of [PELPSPQAEK + 2H] 2+ is mark¬ 
edly different than the full-length sequence. Although both 
features contain multiple peaks, the minor feature for the 
[PELPSPQAEK + 2H] 2+ ion is less than 1 % normalized abun¬ 
dance with the major feature at 249 A 2 . This trend of observing 
different distributions for each subsequent tmncation is ob¬ 
served across all sequences. The distribution for the 
[ELPSPQAEK + 2H] 2+ ion is dominated by a single peak at 
231 A 2 . The removal of the Glu residue from the N-terminus 
leads to two highly abundant peaks for the [LPSPQAEK + 
2H] 2+ ions at 213 and 217 A 2 . Once again, the removal of the 
N-terminal amino acid changes the distribution as only a single 
peak at 185 A 2 is observed for the [PSPQAEK + 2H] 2+ ion. 
The distribution for the [SPQAEK + 2H] 2+ ion has two features 
located at 172 and 178 A 2 . Finally, the [PQAEK + 2H] 2+ ion 
has a single peak at 164 A 2 . 

It is interesting that we do not observe a specific length at 
which all subsequent peptides have similar conformer distribu¬ 
tions for the sequences analyzed in this study. This suggests 
that each amino acid residue influences the conformation, and 
small modifications such as the addition of a single amino acid 
to the N-terminus can lead to global conformational changes. 
Distributions oscillate between being dominated by a single 
peak and multiple peaks as the sequence is extended. The 
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Figure 2. Collision cross section distributions for doubly 
charged peptide [M + 2H] 2+ ions (where M = SPELPSPQAEK, 
PELPSPQAEK, ELPSPQAEK, LPSPQAEK, PSPQAEK, 
SPQAEK, and PQAEK). Distributions were obtained from IMS- 
MS measurements by integrating the drift bins for a narrow m/z 
window for each ion of interest 

[SPELPSPQAEK + 2H] 2+ , [LPSPQAEK + 2H] 2+ , and 
[SPQAEK + 2H] 2+ ion distributions have multiple conforma- 
tions observed in relatively high abundance, whereas the 
[PELPSPQAEK + 2H] 2+ , [ELPSPQAEK + 2H] 2+ , [PSPQAEK 
+ 2H] 2+ , and [PQAEK + 2H] 2+ ion distributions are dominated 
by a single peak. 

One commonality between the sequences with multiple 
features, [SPELPSPQAEK + 2H] 2+ , [LPSPQAEK + 2H] 2+ , 
and [SPQAEK + 2H] 2+ , is the presence of a Pro 2 residue. 
However, when the N-terminal residue is removed such that 
Pro is at position 1, the distribution is dominated by a single 


peak. This suggests that the cis-trans isomerization of Xnp 1 - 
Pro 2 peptide bonds allows for multiple conformer families and 
not the more centrally positioned residues. It is important to 
point out that the cis-trans isomerization of proline occurs at 
the peptide bond preceding proline (i.e., Xaa-Pro). Therefore, 
Pro 1 residues are unable to adopt both cis and trans isomers 
and, consequently, multiple conformers. 

IMS-MS Analysis of Pro Substituted Peptides 

The analysis of the truncated peptides has shown that three 
sequences, [SPELPSPQAEK + 2H] 2+ , [LPSPQAEK + 2H] 2+ , 
and [SPQAEK + 2H] 2+ , have multiple conformer families at 
relatively high abundance. Thus, we will focus on determining 
the amino acid residues that are important to stabilizing multi¬ 
ple conformations for these sequences. This work was inspired 
by previous IMS-MS studies by Pierson et al. in which Ala 
residues were substituted for Pro residues in the nonapeptide 
bradykinin (BK) [27]. Substituting Ala for Pro residues likely 
fixes the Xaa-Ala peptide bond in the trans isomer. It was 
shown that the cis-trans isomerization of proline residues are 
important for triply charged [BK + 3H] 3+ ion conformations 
and specific Xaa-Pro bonds were assigned as cis or trans for 
different conformations. 

SPELPSPQAEK Pro —> Ala Substitutions 

Figure 3 shows the collision cross section distributions for the 
Pro -► Ala analogues of the [SPELPSPQAEK + 2H] 2+ ions. 
Throughout this manuscript, cross section distributions for Pro 
—> Ala substituted peptides have been corrected for the size 
difference between proline and alanine as explained above. The 
cross section distribution for the Pro 2 —► Ala substitution is 
dominated by a single peak at 252 A 2 . This suggests that the 
cis-trans isomerization of Ser ! -Pro 2 peptide bond of 
[SPELPSPQAEK + 2H] 2+ ion is crucial for the conformation 
at 258 A 2 . Furthermore, it supports the results from the trun¬ 
cated series that suggest Pro 2 residues are important for multi¬ 
ple conformers. 

In order to verify that the N-terminal region of the peptide 
leads to multiple structures, we have analyzed the Pro 5 and Pro 7 
substituted peptides. Both [SPELASPQAEK + 2H] 2+ and 
[SPELPSAQAEK + 2H] 2+ distributions have multiple peaks 
(Figure 3). The collision cross sections of the major peak for 
both the Ala 5 and Ala 7 peptides is 260 A 2 . The conformers at 
260 A 2 are less than 1% larger than the major peak at 258 A 2 
for the peptide [SPELPSPQAEK + 2H] 2+ ion. Although we 
observe a peak near 258 A 2 when both Pro 5 and Pro 7 are 
substituted, this peak is not observed for the Pro 2 substitution. 
Therefore, we assign the conformation at 258 A 2 as cis- Pro 2 , 
trans- Pro 5 , trans- Pro 7 . A summary of cis-trans isomer assign¬ 
ments and populations can be found in Table l.tgroup 

As mentioned above, both the distributions for the Pro 5 —> 
Ala and Pro 7 —> Ala sequences have minor features. Although 
our data suggests that the Pro 5 and Pro 7 residues are not the 
sites key to having multiple peaks, they dramatically influence 
the distribution of conformations. The minor features for the 
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Figure 3. Collision cross section distributions for Pro 2 -► Ala, 
Pro 5 -> Ala, Pro 7 -► Ala, and Pro 2,5,7 -► Ala substituted peptides 
from the [SPELPSPQAEK + 2H] 2+ ions. Substituted distribu¬ 
tions are shown in red and the natural sequence is shown as a 
dotted trace for ease of comparison. Collision cross section 
distributions for substituted peptides are adjusted for the size 
difference between Ala and Pro as explained in text 

Ala 5 substituted peptide is 248 A 2 . The distribution for the Ala 7 
substituted peptide has two minor features at 251 and 255 A 2 . 
In addition to the difference in the number of peaks and their 
respective cross sections, the populations are also different. 
Several previous studies have focused on the local environment 
surrounding proline residues with significant attention to the 
identity of the residue N-terminally adjacent to proline [40,41]. 
A recent study suggests that the nonlocal environment influ¬ 
ences cis -Pro residues in proteins found in the Protein 
Databank [42]. Our results suggest that distal residues 


influence the population of cis-trans isomers for peptide ions. 
One possible explanation for this is that intramolecular inter¬ 
actions that stabilize the structure of the peptide change upon 
substituting Ala for Pro. The backbone nitrogen atoms of 
proline residues are unable to participate in intramolecular H- 
bonding because of the lack of hydrogen bond donor on the 
backbone nitrogen. However, Ala residues are able to form 
intramolecular H-bonds. 

We have performed the triple substitution, Pro 2,5,7 —> Ala, 
as this peptide should represent the all trans peptide. The cross 
section distribution for the triply substituted peptide has a 
single peak located at 254 A 2 , less than 1% larger than the 
peak at 252 A 2 for the peptide [SPELPSPQAEK + 2H] 2+ ion 
(Figure 3). Therefore, we have confirmed the conformer at 252 
A 2 for the [SPELPSPQAEK + 2H] 2+ ion as the trans config¬ 
uration at Pro 2 , Pro 5 , and Pro 7 (Table 1). 

LPSPQAEK Pro —> Ala Substitutions 

Figure 4 shows the collision cross section distribution for the 
Pro —> Ala substituted peptides for the [LPSPQAEK + 2H] 2+ 
ions. Upon Pro 2 —► Ala substitution, we observe a distribution 
that is dominated by a single conformation at 215 A 2 . Although 
a minor feature is observed at 212 A 2 , it comprises less than 1 % 
of the distribution. In contrast, the Pro 4 —> Ala [LPSAQAEK + 
2H] 2+ distribution is strikingly similar to the [LPSPQAEK + 
2H] 2+ ion distribution, having two peaks at 213 and 216 A 2 of 
nearly equal abundance. This suggests that the Pro 2 residue is 
important to observing multiple conformer families of nearly 
equal abundance. The conformer at 213 A 2 is assigned as cis- 
Pro 2 , trans- Pro 4 (Table 1). Due to the presence of a peak -217 
A 2 for both the Pro 2 —► Ala and Pro 4 —> Ala distributions, this 
conformer is assigned as trans- Pro 2 and trans- Pro 4 (Table 1). 

In order to confirm the cis-trans isomer assignment, we 
have analyzed the doubly substituted peptide Pro 2,4 —> Ala. 
As shown in Figure 4, the cross section distribution for the 
doubly substituted Pro 2,4 —> Ala peptide has a single peak at 
217 A 2 . [LASAQAEK + 2H] 2+ likely represents the all -trans 
peptide. This provides further evidence that conformer at 217 
A 2 for [LPSPQAEK + 2H] 2+ ions is trans- Pro 2 , trans- Pro 4 
(Table 1). 

SPQAEK Pro —> Ala Substitutions 

Figure 5 shows the cross section distribution for the Pro 2 —> 
Ala substituted peptide [SAQAEK + 2H] 2+ . Upon substitution, 
the distribution has a single peak at 177 A 2 . This peak is 
aligned with the minor feature located at 178 A 2 of the 
[SPQAEK + 2H] 2+ distribution. Because this peptide contains 
a single Pro residue, the assignment is relatively straightfor¬ 
ward. The Se^-Pro 2 peptide bond for the conformer at 178 A 2 
is assigned as trans because a similar conformer is observed for 
the [SAQAEK + 2H] 2+ , which likely has a trans-Ser'-Ala 2 
peptide bond. The conformer at 172 A 2 is assigned as a cis- Pro 2 
peptide bond because it is not observed in the [SAQAEK + 
2H] 2+ distribution (Table 1). 
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Table 1. Proline Cis-Trans Isomer Assignments 


Sequence 

Collision cross section (A 2 ) 

Normalized abundance (%) a 

Pro 2 

Pro 4 

Pro 5 

Pro 7 

SPELPSPQAEK 

258 

76.9 

Cis 

_ 

Trans 

Trans 

SPELPSPQAEK 

252 

23.1 

Trans 

- 

Trans 

Trans 

LPSPQAEK 

213 

57.0 

Cis 

Trans 

- 

- 

LPSPQAEK 

217 

43.0 

Trans 

Trans 

- 

- 

SPQAEK 

172 

83.4 

Cis 

- 

- 

- 

SPQAEK 

178 

16.6 

Trans 

- 

- 

- 


Peak abundances are the integrated area for each respective peak normalized to the total area for each collision cross section distribution shown in Figure 2. 


Examination of a Penultimate Proline-Containing 
Peptide Library 

In order to determine if the trend of peptides with a penultimate 
proline residue adopting multiple conformers is widely ob¬ 
served, ion mobility distributions were measured for 52 addi¬ 
tional [M + 2H] 2+ peptides with a penultimate proline. IMS- 
MS data was obtained from a library of synthesized peptides in 
a similar method to that shown in Figure 1, including both 
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Figure 4. Collision cross section distributions for Pro 2 -► Ala, 
Pro 4 -* Ala, and Pro 2,4 -+ Ala substituted peptides from the 
[LPSPQAEK + 2H] 2+ ions. Substituted distributions are shown 
in red and the natural sequence is shown as a dotted trace for 
ease of comparison. Collision cross section distributions for 
substituted peptides are adjusted for the size difference be¬ 
tween Ala and Pro as explained in text 


tmncated and full-length sequences. Figure 6 shows collision 
cross section distributions for several [M + 2H] 2+ peptides from 
this dataset. In both examples, we observe a single peak in the 
mobility distribution for the Pro 1 peptide, multiple peaks of 
considerable abundance for the Pro 2 peptides, and a distribu¬ 
tion that is dominated by a single peak for the Pro 3 peptides. 

If we consider all penultimate proline-containing peptides 
examined in this study, 46 of 58 (79%) peptide sequences have 
multiple conformations in the IMS distribution. The complete 
list of sequences is listed in Supplementary Information (Table 
SI). Previous IMS-MS measurements by Counterman et al. 
found that 57% of proline-containing peptides had multiple 
features in the IMS-MS distribution [26]. It is worth mention¬ 
ing that the previous measurement included Pro 2 peptides. This 
suggests that penultimate proline-containing peptides adopt 
multiple conformations at a relatively high frequency across a 
range of peptide lengths and sequence compositions. 

Investigation into Cis- versus Trans-Stabilizing 
Intramolecular Interactions by Molecular Modeling 

Preliminary molecular modeling simulations were performed to 
provide insight into the differences in the intramolecular interac¬ 
tions that exist when Xaa-Pro bonds adopt the cis versus trans 
configuration. Simulations were performed on the [SPQAEK + 
2H] 2+ ion. This peptide was chosen because it has two well- 
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Figure 5. Collision cross section distributions for Pro 2 -* Ala 
from the [SPQAEK + 2H] 2+ ions. Substituted distributions are 
shown in red and the natural sequence is shown as a dotted 
trace for ease of comparison. Collision cross section distribu¬ 
tions for substituted peptides are adjusted for the size differ¬ 
ence between Ala and Pro as explained in text 
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Figure 6. Collision cross section distributions for a series of [M + 2H] 2+ peptides containing Pro 1 , Xaa^Pro 2 , and Xaa^Xaa^Pro 3 
sequences 


resolved peaks in the cross section distribution, contains a single 
proline residue at position two, and is relatively small. Therefore, 
it provides an excellent model to probe the intramolecular inter¬ 
actions that stabilize both the cis and trans geometries of proline 
when located at the second position as shown in the experimental 
results. 

Figure 7 shows two representative low-energy structures 
obtained from molecular modeling simulations. We selected 
the lowest energy structure for the c/s-Pro isomer and a rela¬ 
tively low energy structure with similar deviation from the 
experimental results for the trans-Pro isomer. Collision cross 
section values are within 2.0% of the cross sections measured 
by IMS-MS; the modeled czs-Pro (Figure 7a) and trans -Pro 
(Figure 7b) conformers have calculated cross sections of 175 
and 181 A 2 , respectively. Several general similarities exist 



between the two modeled structures—the Gin, Glu, and Lys 
side chains interact with the peptide backbone and the N- and 
C-terminus interact with each other. However, one key differ¬ 
ence appears to be the disparity in the interactions with the 
oxygen atom of the second carbonyl group of the Pro 2 residue. 
The cis-Pro peptide has an interaction between the protonated 
N-terminus and the carbonyl oxygen of Pro 2 . Conversely, the 
trans-Pro peptide has an interaction between the protonated 
Lys side chain and the carbonyl oxygen of Pro 2 . Therefore, the 
interaction of the protonated N-terminus and the carbonyl of 
the Pro 2 side chain may stabilize the cis geometry of the Ser 1 - 
Pro 2 peptide bond. This may also explain why we did not 
observe similar distributions for conformations after an amino 
acid is added to the N-terminus such that proline is in the third 
position. Upon addition of amino acids to the N-terminus, this 


(b) 



Figure 7. Representative structures of the [SPQAEK + 2H] 2+ ions with Ser'-Pro 2 peptide bond in the cis (a) and trans (b) forms 
obtained from molecular modeling simulations. Amino acid side chains, terminal regions, and the geometry of the Se^-Pro 2 bond 
are labeled for clarity. H-bonds determined using the recommended relaxed constrains in Chimera are represented as dotted lines 
[43] 
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interaction may become less energetically favorable and a 
single conformation may become preferred. 

Simple Mechanistic Consideration 

In addition to the intramolecular interactions that are key to 
penultimate proline residues adopting multiple conformations 
mentioned above, it is worthwhile to consider potential mech¬ 
anistic explanations for the preference of penultimate proline 
residues to adopt multiple conformations. One likely explana¬ 
tion is that the low steric impedance of rotating the Xaa '-Pro 2 
bond leads to a relatively low barrier between the cis and trans 
isomers. This allows the peptide to rotate more freely between 
the cis and trans isomers, making the tail region more flexible 
to adopt multiple conformers. For more internally located 
proline residues, the barrier to rotate larger regions of the 
peptide surrounding Xaa-Pro bonds may be increased com¬ 
pared with the terminal region. Also, intramolecular interac¬ 
tions surrounding the peptide may more effectively lock the 
Xaa-Pro peptide bond into a single isomer. We cannot rule out 
that a similar effect would be observed for proline residues 
located at the C-terminus. However, we have only studied 
tryptic peptides to date; therefore, all sequences analyzed ter¬ 
minate in Lys or Arg residues. 

Conclusions 

IMS-MS techniques and molecular modeling were used to 
examine the conformation families in a series of truncated 
and Pro —> Ala substituted analogues of the peptide 
[SPELPSPQAEK + 2H] 2+ ion. Our results are in good agree¬ 
ment with previous IMS-MS studies that suggest proline resi¬ 
dues are important for multiple conformations [26, 27]. How¬ 
ever, for this system, all proline residues are not equally im¬ 
portant. We find a preference for Pro 2 residues to adopt both 
the cis and trans isomer, resulting in multiple conformer fam¬ 
ilies observed by IMS-MS. We find this trend to be true across 
a variety of peptide sequences; ~80% of all penultimate 
proline-containing peptides analyzed have multiple stable con¬ 
formations. This is not meant to imply that only penultimate 
proline residues can adopt multiple conformations; previous 
IMS-MS studies have observed Pro residues located at other 
positions in peptides with multiple conformations [26, 27]. 
Rather, we suggest that this position may have an intrinsic 
ability to stabilize both cis- and /Ams'-Xaa'-Pro 2 isomers. This 
is likely due to low steric impedance of the N-terminal region, 
making this region of the peptide flexible and allowing for 
isomerization between the cis and trans forms of Xaa'-Pro 2 
peptide bonds. 

Finally, it is interesting to consider the biological implica¬ 
tions of Pro 2 establishing multiple conformations. Proline is 
commonly found in the second position in many biological 
systems, including neuro- and vasoactive peptides [2, 4]. Al¬ 
though the high frequency of Pro 2 residues is often hypothe¬ 
sized to stem from the ability of Pro residues to protect the N- 
terminal region from enzymatic degradation [1-3], it is 


possible that if Xaa^Pro 2 peptide bonds have a propensity to 
adopt both cis and trans isomers, this may have additional 
implications in peptide signaling. 
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